Loop-length-dependent SVM prediction of domain linkers for high-throughput structural proteomics.

نویسندگان

  • Teppei Ebina
  • Hiroyuki Toh
  • Yutaka Kuroda
چکیده

The prediction of structural domains in novel protein sequences is becoming of practical importance. One important area of application is the development of computer-aided techniques for identifying, at a low cost, novel protein domain targets for large-scale functional and structural proteomics. Here, we report a loop-length-dependent support vector machine (SVM) prediction of domain linkers, which are loops separating two structural domains. (DLP-SVM is freely available at: http://www.tuat.ac.jp/ approximately domserv/cgi-bin/DLP-SVM.cgi.) We constructed three loop-length-dependent SVM predictors of domain linkers (SVM-All, SVM-Long and SVM-Short), and also built SVM-Joint, which combines the results of SVM-Short and SVM-Long into a single consolidated prediction. The performances of SVM-Joint were, in most aspects, the highest, with a sensitivity of 59.7% and a specificity of 43.6%, which indicated that the specificity and the sensitivity were improved by over 2 and 3% respectively, when loop-length-dependent characteristics were taken into account. Furthermore, the sensitivity and specificity of SVM-Joint were, respectively, 37.6 and 17.4% higher than those of a random guess, and also superior to those of previously reported domain linker predictors. These results indicate that SVMs can be used to predict domain linkers, and that loop-length-dependent characteristics are useful for improving SVM prediction performances.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fatigue Life Prediction of Rivet Joints

Strength reduction in structures like an aircraft could be resulted as cyclic loads over a period of time and is an important factor for structural life prediction. Service loads are emphasized at the regions of stress concentration, mostly at the connection of components. The initial flaw prompting the service life was expected by using the Equivalent Initial Flaw Size (EIFS) which has been re...

متن کامل

DFLpred: High-throughput prediction of disordered flexible linker regions in protein sequences

MOTIVATION Disordered flexible linkers (DFLs) are disordered regions that serve as flexible linkers/spacers in multi-domain proteins or between structured constituents in domains. They are different from flexible linkers/residues because they are disordered and longer. Availability of experimentally annotated DFLs provides an opportunity to build high-throughput computational predictors of thes...

متن کامل

Inferring Protein Interaction Network by Boosting Algorithm

One of major goals of functional genomics is to elucidate protein interaction networks for whole organisms. Determining protein interactions provides not only detailed functional insights on characterized proteins, but also an information base for identifying biological complexes and metabolic or signal transduction pathways [1]. The recent emergence of high-throughput proteomics techniques has...

متن کامل

Protein-protein interaction map inference using interacting domain profile pairs

UNLABELLED A number of predictive methods have been designed to predict protein interaction from sequence or expression data. On the experimental front, however, high-throughput proteomics technologies are starting to yield large volumes of protein-protein interaction data. High-quality experimental protein interaction maps constitute the natural dataset upon which to build interaction predicti...

متن کامل

Machine Learning Structural and Functional Proteomics

While new high-throughput experimental techniques are being developed for proteomics applications (e.g. mass spectrometry, protein chips), it is clear that given the fundamental importance of proteins to biology, biotechnology, and medicine, computer methods that can rapidly sift through massive amounts of data and help determine the structure and function of a large number of proteins in a giv...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biopolymers

دوره 92 1  شماره 

صفحات  -

تاریخ انتشار 2009